Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing

نویسندگان

  • Graham A. Heap
  • Jennie H.M. Yang
  • Kate Downes
  • Barry C. Healy
  • Karen A. Hunt
  • Nicholas Bockett
  • Lude Franke
  • Patrick C. Dubois
  • Charles A. Mein
  • Richard J. Dobson
  • Thomas J. Albert
  • Matthew J. Rodesch
  • David G. Clayton
  • John A. Todd
  • David A. van Heel
  • Vincent Plagnol
چکیده

Many disease-associated variants identified by genome-wide association (GWA) studies are expected to regulate gene expression. Allele-specific expression (ASE) quantifies transcription from both haplotypes using individuals heterozygous at tested SNPs. We performed deep human transcriptome-wide resequencing (RNA-seq) for ASE analysis and expression quantitative trait locus discovery. We resequenced double poly(A)-selected RNA from primary CD4(+) T cells (n = 4 individuals, both activated and untreated conditions) and developed tools for paired-end RNA-seq alignment and ASE analysis. We generated an average of 20 million uniquely mapping 45 base reads per sample. We obtained sufficient read depth to test 1371 unique transcripts for ASE. Multiple biases inflate the false discovery rate which we estimate to be approximately 50% for random SNPs. However, after controlling for these biases and considering the subset of SNPs that pass HapMap QC, 4.6% of heterozygous SNP-sample pairs show evidence of imbalance (P < 0.001). We validated four findings by both bacterial cloning and Sanger sequencing assays. We also found convincing evidence for allelic imbalance at multiple reporter exonic SNPs in CD6 for two samples heterozygous at the multiple sclerosis-associated variant rs17824933, linking GWA findings with variation in gene expression. Finally, we show in CD4(+) T cells from a further individual that high-throughput sequencing of genomic DNA and RNA-seq following enrichment for targeted gene sequences by sequence capture methods offers an unbiased means to increase the read depth for transcripts of interest, and therefore a method to investigate the regulatory role of many disease-associated genetic variants.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allelic imbalances in human bladder cancer: genome-wide detection with high-density single-nucleotide polymorphism arrays.

BACKGROUND Bladder cancer is characterized by genomic instability. In this study, we investigated whether genome-wide screening using single-nucleotide polymorphism (SNP) arrays could detect allelic imbalance (loss or gain of at least one allele) in bladder cancers. METHODS For microarray analysis, DNA was isolated from microdissected bladder tumors and leukocytes from 11 patients. The stage ...

متن کامل

Chapter 6 Identi fi cation of Imprinted Loci by Transcriptome Sequencing

Enabled by high-throughput technologies that are capable of generating millions of sequencing reads, transcriptome sequencing is emerging as an important approach for mapping allelic imbalance (AI), where transcription is biased toward one allele in a diploid system. AI is identi fi ed by counting sequencing reads that map to genomic regions containing heterozygous SNPs, where the base identity...

متن کامل

Tumor Transcriptome Sequencing Reveals Allelic Expression Imbalances Associated with Copy Number Alterations

Due to growing throughput and shrinking cost, massively parallel sequencing is rapidly becoming an attractive alternative to microarrays for the genome-wide study of gene expression and copy number alterations in primary tumors. The sequencing of transcripts (RNA-Seq) should offer several advantages over microarray-based methods, including the ability to detect somatic mutations and accurately ...

متن کامل

Discovery of a large set of SNP and SSR genetic markers by high-throughput sequencing of pepper (Capsicum annuum).

Genetic markers based on single nucleotide polymorphisms (SNPs) are in increasing demand for genome mapping and fingerprinting of breeding populations in crop plants. Recent advances in high-throughput sequencing provide the opportunity for whole-genome resequencing and identification of allelic variants by mapping the reads to a reference genome. However, for many species, such as pepper ...

متن کامل

Genome-wide genetic characterization of bladder cancer: a comparison of high-density single-nucleotide polymorphism arrays and PCR-based microsatellite analysis.

Most human cancers are characterized by genomic instability, the accumulation of multiple genetic alterations, and allelic imbalance throughout the genome. Loss of heterozygosity (LOH) is a common form of allelic imbalance, and the detection of LOH has been used to identify genomic regions that harbor tumor suppressor genes and to characterize different tumor types, pathological stages and prog...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 19  شماره 

صفحات  -

تاریخ انتشار 2010